skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Wheeler, Andrew"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. The current “consensus” order in which amino acids were added to the genetic code is based on potentially biased criteria, such as the absence of sulfur-containing amino acids from the Urey–Miller experiment which lacked sulfur. More broadly, abiotic abundance might not reflect biotic abundance in the organisms in which the genetic code evolved. Here, we instead identify which protein domains date to the last universal common ancestor (LUCA) and then infer the order of recruitment from deviations of their ancestrally reconstructed amino acid frequencies from the still-ancient post-LUCA controls. We find that smaller amino acids were added to the code earlier, with no additional predictive power in the previous consensus order. Metal-binding (cysteine and histidine) and sulfur-containing (cysteine and methionine) amino acids were added to the genetic code much earlier than previously thought. Methionine and histidine were added to the code earlier than expected from their molecular weights and glutamine later. Early methionine availability is compatible with inferred early use of S-adenosylmethionine and early histidine with its purine-like structure and the demand for metal binding. Even more ancient protein sequences—those that had already diversified into multiple distinct copies prior to LUCA—have significantly higher frequencies of aromatic amino acids (tryptophan, tyrosine, phenylalanine, and histidine) and lower frequencies of valine and glutamic acid than single-copy LUCA sequences. If at least some of these sequences predate the current code, then their distinct enrichment patterns provide hints about earlier, alternative genetic codes. 
    more » « less
    Free, publicly-accessible full text available December 24, 2025
  2. Nearly neutral theory predicts that species with higher effective population size (N_e) are better at purging slightly deleterious mutations. We compare evolution in high N_e vs. low-N_e vertebrates to reveal subtle selective preferences among amino acids. We take three complementary approaches. First, we fit non-stationary substitution models using maximum likelihood, comparing the high-N_e clade of rodents and lagomorphs to its low-N_e sister clade of primates and colugos. Second, we compared evolutionary outcomes across a wider range of vertebrates, via correlations between amino acid frequencies and N_e. Third, we dissected which amino acids substitutions occurred in human, chimpanzee, mouse, and rat, as scored by parsimony – this also enabled comparison to a historical paper. All methods agree on amino acid preference under more effective selection. Preferred amino acids are less costly to synthesize and use GC-rich codons, which are hard to maintain under AT-biased mutation. These factors explain 85% of the variance in amino acid preferences. Parsimony-induced bias in the historical study produces an apparent reduction in structural disorder, perhaps driven by slightly deleterious substitutions in rapidly evolving regions. Within highly exchangeable pairs of amino acids, arginine is strongly preferred over lysine, aspartate over glutamate, and valine over isoleucine, consistent with more effective selection preferring a marginally larger free energy of folding. Two of these preferences (K→R and I→V), but not a third (E→D) match differences between thermophiles and mesophilic relatives. These results reveal the biophysical consequences of mutation-selection-drift balance, and demonstrate the utility of nearly neutral theory for understanding protein evolution. 
    more » « less
  3. Hobert, O (Ed.)
    Abstract Hox transcription factors are conserved regulators of neuronal subtype specification on the anteroposterior axis in animals, with disruption of Hox gene expression leading to homeotic transformations of neuronal identities. We have taken advantage of an unusual mutation in the Caenorhabditis elegans Hox gene lin-39, lin-39(ccc16), which transforms neuronal fates in the C. elegans male ventral nerve cord in a manner that depends on a second Hox gene, mab-5. We have performed a genetic analysis centered around this homeotic allele of lin-39 in conjunction with reporters for neuronal target genes and protein interaction assays to explore how LIN-39 and MAB-5 exert both flexibility and specificity in target regulation. We identify cis-regulatory modules in neuronal reporters that are both region-specific and Hox-responsive. Using these reporters of neuronal subtype, we also find that the lin-39(ccc16) mutation disrupts neuronal fates specifically in the region where lin-39 and mab-5 are coexpressed, and that the protein encoded by lin-39(ccc16) is active only in the absence of mab-5. Moreover, the fates of neurons typical to the region of lin-39-mab-5 coexpression depend on both Hox genes. Our genetic analysis, along with evidence from Bimolecular Fluorescence Complementation protein interaction assays, supports a model in which LIN-39 and MAB-5 act at an array of cis-regulatory modules to cooperatively activate and to individually activate or repress neuronal gene expression, resulting in regionally specific neuronal fates. 
    more » « less